Theory and Applications of Agnostic PAC-Learning with Small Decision Trees

نویسندگان

  • Peter Auer
  • Robert C. Holte
  • Wolfgang Maass
چکیده

We exhibit a theoretically founded algorithm T2 for agnostic PAC-learning of decision trees of at most 2 levels, whose computation time is almost linear in the size of the training set. We evaluate the performance of this learning algorithm T2 on 15 common “real-world” datasets, and show that for most of these datasets T2 provides simple decision trees with little or no loss in predictive power (compared with C4.5). In fact, for datasets with continuousattributes its error rate tends to be lower than that of C4.5. To the best of our knowledge this is the first time that a PAC-learning algorithm is shown to be applicable to “real-world” classification problems. Since one can prove that T2 is an agnostic PAClearning algorithm, T2 is guaranteed to produce close to optimal 2-level decision trees from sufficiently large training sets for any (!) distribution of data. In this regard T2 differs strongly from all other learning algorithms that are considered in applied machine learning, for which no guarantee can be given about their performance on new datasets. We also demonstrate that this algorithm T2 can be used as a diagnostic tool for the investigation of the expressive limits of 2-level decision trees. Finally, T2, in combination with new bounds on the VC-dimension of decision trees of bounded depth that we derive, provides us now for the first time with the tools necessary for comparing learning curves of decision trees for “real-world” datasets with the theoretical estimates of PAClearning theory.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Distribution-Specific Agnostic Boosting

We consider the problem of boosting the accuracy of weak learning algorithms in the agnostic learning framework of Haussler (1992) and Kearns et al. (1992). Known algorithms for this problem (BenDavid et al., 2001; Gavinsky, 2002; Kalai et al. , 2008) follow the same strategy as boosting algorithms in the PAC model: the weak learner is executed on the same target function but over different dis...

متن کامل

Agnostic PAC Learning Decision Lists is Hard

Agnostic PAC Learning Let X be the set of individuals and H a set of predicates over X . A learning algorithm L is said to be a agnostic PAC learning algorithm for H if it satisfies the following: given any , δ ∈ (0, 1), there is an integer m( , δ) such that for all m ≥ m( , δ), for any t ∈ H and any probability distribution μ on X × {0, 1}, with probability at least 1 − δ, given a sample of si...

متن کامل

Representation, Approximation and Learning of Submodular Functions Using Low-rank Decision Trees

We study the complexity of approximate representation and learning of submodular functions over the uniform distribution on the Boolean hypercube {0, 1}. Our main result is the following structural theorem: any submodular function is -close in `2 to a real-valued decision tree (DT) of depth O(1/ ). This immediately implies that any submodular function is -close to a function of at most 2 ) vari...

متن کامل

(Agnostic) PAC Learning Concepts in Higher-Order Logic

This paper studies the PAC and agnostic PAC learnability of some standard function classes in the learning in higher-order logic setting introduced by Lloyd et al. In particular, it is shown that the similarity between learning in higher-order logic and traditional attributevalue learning allows many results from computational learning theory to be ‘ported’ to the logical setting with ease. As ...

متن کامل

COMS 6253 : Advanced

Previously: • Administrative basics, introduction and high-level overview • Concept classes and the relationships among them: DNF formulas, decision trees, decision lists, linear and polynomial threshold functions. • The Probably Approximately Correct (PAC) learning model. • PAC learning linear threshold functions in poly(n, 1/ , log 1/δ) time • PAC learning polynomial threshold functions. Toda...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995